Un–complicating protein complex prediction
نویسندگان
چکیده
Identification of protein complexes from proteomic experiments is crucial to understand not only their function but also the principles of cellular organization. Advances in experimental techniques have enabled the construction of large–scale protein–protein interaction networks, and computational methods have been developed to analyze high–throughput data. In most cases several parameters are introduced that have to be trained before application. But how do we select the parameter values when there are no training data available? How many data do we need to properly train a method. How is the performance of a method affected when we incorrectly select the parameter values? The above questions, although important to determine the applicability of a method, are most of the time overlooked. We highlight the importance of such an analysis by investigating how limited knowledge, in the form of incomplete training data, affects the performance of parametric protein–complex prediction algorithms. Furthermore, we develop a simple non–parametric method that does not rely on the existence of training data and we compare it with the parametric alternatives. Using datasets from yeast and fly we demonstrate that parametric methods trained with limited data provide sub–optimal predictions, while our non–parametric method performs better or is on par with the parametric alternatives. Overall, our analysis questions, at least for the specific problem, whether parametric methods provide significantly better results than non–parametric ones to justify the additional effort for applying them.
منابع مشابه
Formulation and prediction of Dam's optimum capacity in un-gaged regions according to time analysis of store and GIS technique
Aim of this study is to estimate the optimized volume of reservoir in un-gaged basins and extract the estimation pattern of reservoir capacity for other regions. For this purpose, Ziarat was divided to 9 sub-basins using ArcGIS and the physiographical parameters was determined for each of them. for modelling of volume variation, demand before and after rain and also product before and after s...
متن کاملApplication of three graph Laplacian based semi-supervised learning methods to protein function prediction problem
Protein function prediction is the important problem in modern biology. In this paper, the un-normalized, symmetric normalized, and random walk graph Laplacian based semi-supervised learning methods will be applied to the integrated network combined from multiple networks to predict the functions of all yeast proteins in these multiple networks. These multiple networks are network created from ...
متن کاملTemplate Synthesis of Un-Symmetrical Tetradentate Schiff Base Complexes of Ni(II), Co(II), Zn(II) and X-ray Structure of Ni(II) Complex
Metal complexes, ML (M=Ni, Cu, Co and Zn) of unsymmetrical tetradentate schiff base have been synthesized by the template reaction of the half-units N-(1-hydroxy-2-acetonaphthone)-1-amino-2-phenyleneimine)(HL1) with 2-Pyrroleca-rbaldehyde. The complexes have been characterized by elemental analysis, IR, 1H NMR and UV spectroscopy. The crystal structure of the Ni(II) complex has been determine...
متن کاملMid-Trimester Amniotic Fluid High Sensitive C-Reactive Protein Level in the Prediction of Preterm Delivery
Background and Objectives: Preterm labor is a serious problem in obstetrics, accounting for 70% of perinatal mortality. High sensitive C - reactive protein (HS-CRP) is a sensitive marker of Inflammation. Our aim in this study was to determine Amniotic fluid hs-CRP concentration and its correlation with pre-Term delivery. Materials and methods: This prospective study was conducted on 90 p...
متن کاملInteraction Between Crude Protein Level and Rumen Protected Amino Acids in Starter Diet on Performance of Dairy Calves
BACKGROUND: A limited number of studies have investigated the inclusion of amino acids in calf starter. OBJECTIVES: This study was aimed to evaluate different levels of crude protein in starter and supplement- ing some protected amino acids on efficiency, health status and blood metabolites of dairy calves and to com- pare it with unprotected amino acids. <st...
متن کامل